Encoding Co-occurrence of Features in the HMAX Model
نویسندگان
چکیده
We introduce a method for encoding co-occurrence of features in the HMAX model of visual recognition, and conduct a series of experiments to investigate the contribution of co-occurrence towards better recognition performance. We show that classification accuracy is increased by adding a higher-order layer to the HMAX processing hierarchy, whereby co-occurrence of features is encoded as a new dictionary of features. We show that concatenation of mean pooling, max pooling and co-occurrence information results in better classification results on three datasets (Caltech101, a subset of Caltech256, and TMSI Underwater Images). Overall, we show that incorporating co-occurrence statistics into a biologically-inspired model of visual recognition provides a boost in classification performance above that produced by incorporating occurrence statistics alone.
منابع مشابه
A Biological Model of Object Recognition with Feature Learning
Previous biological models of object recognition in cortex have been evaluated using idealized scenes and have hard-coded features, such as the HMAX model by Riesenhuber and Poggio [10]. Because HMAX uses the same set of features for all object classes, it does not perform well in the task of detecting a target object in clutter. This thesis presents a new model that integrates learning of obje...
متن کاملImage retrieval using the combination of text-based and content-based algorithms
Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...
متن کاملReceptive Field Encoding Model for Dynamic Natural Vision
Introduction: Encoding models are used to predict human brain activity in response to sensory stimuli. The purpose of these models is to explain how sensory information represent in the brain. Convolutional neural networks trained by images are capable of encoding magnetic resonance imaging data of humans viewing natural images. Considering the hemodynamic response function, these networks are ...
متن کاملVisual dictionaries as intermediate features in the human brain
The human visual system is assumed to transform low level visual features to object and scene representations via features of intermediate complexity. How the brain computationally represents intermediate features is still unclear. To further elucidate this, we compared the biologically plausible HMAX model and Bag of Words (BoW) model from computer vision. Both these computational models use v...
متن کاملEnhanced HMAX model with feedforward feature learning for multiclass categorization
In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and functions of V1 to posterior inferotemporal (PIT) layer of the primate visual cortex, which could ...
متن کامل